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Abstract 

We present a model to describe the inbound air traffic over a con- 
gested hub. We show that this model gives a very accurate description of 
the traffic by the comparison of our theoretical distribution of the queue 
with the actual distribution observed over Heathrow airport. We discuss 
also the robustness of our model. 

1 INTRODUCTION 

Airport congestion is a persistent phenomena in air traffic. Air traffic con- 
gestion is significant even if the principal airports in Western and Central 
Europe are treated as "fully coordinated"]^ meaning essentially that the 
number of flights that can be scheduled there per hour (or other unit of 
time) is not allowed to exceed the "declared capacity" of the airports In 
2011, the average additional Arrival Sequencing and Metering Area (ASM A) 
tim^Jat the top 30 european airports amounted to 2.9 minutes per arrival 
increasing by +5% with respect to the previous year. On this statistic, London 
Heathrow is a clear outlier, having by far the highest level of additional time 
within the last 40NM with 8.2 minutes per arrival, followed by Frankfurt and 
Madrid [9]. Similar situations occur in the US (4]. 

Quoting the 2011 Performance Review Report: "Airports are key nodes of 
the aviation network and airport capacity is considered to be one of the 
main challenges to future air traffic growth. This requires an increased focus 
on the integration of airports in the ATM network and the optimisation of 
operations at and around airports". 

Several approaches have been proposed to mitigate congestion and resolve 
demand-capacity imbalances. At operational level (short-term), these ap- 
proaches consider the operational adjustment of air traffic flows to match 
available capacity. So far, the most popular approach in resolving these 
short-term periods of congestion has shown to be the allocation of ground 
delays IjTjp. The Ground Holding Problem considers the development of 
strategies for allocating ground delays to aircraft, and has received consid- 
erable attention Ifi6l [7| [?} fTl. However, these air traffic flow management 

1 In the U.S., scheduling limits are applied only to New York region airports, Washington /Reagan, 
and Chicago/O'Hare airport, under the High Density Rule (HDR). 

2 The Arrival Sequencing and Metering Area (ASMA) is the airspace within a radius of 40NM 
around an airport. The ASMA additional time is a proxy for the average arrival runway queuing 
time on the inbound traffic flow, during times when the airport is congested. 
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strategies might be suboptimal because they do not capture the inherent 
unpredictability of arrivals at airports. [4] showed that changes in the current 
practice for setting airport arrival rates can lead to significant benefits in 
terms of additional ASM A times. 

In view of the current situation, it is extremely important to have a reliable 
tool to measure and forecast congestion in the Air Traffic System. However, 
in developing such a tool there are some issues to address. First of all, 
the stochastic models developed so far to describe air traffic congestion are 
not reliable. The interarrival times between two consecutive arrivals at an 
airport seem to be distributed exponentially, leading to the hypothesis that 
the arrivals are Poissonian JiT) . Nonetheless, the results of the classical 
queueing theory with Poisson arrivals do not fit with the observed data; see 
Figure [3] below. 

A second issue regards the validation of the stochastic processes. In fact, it 
is not easy to quantify the amount of congestion, because it is difficult to 
evaluate the number of aircrafts in queue. 

In this paper we propose a straightforward description of the queueing 
process at a very congested airport. We study the inbound air traffic at 
Heathrow airport. We compute the time spent in queue calculating the 
difference between the actual and the minimal time spent in the vicinities 
of the airport. The extra time is considered to be time spent in queue (the 
details of this simple computation are in Section^. This procedure solves 
the problem of evaluating the distribution of the queue in this very congested 
case. 

We present a quite natural mathematical model for the arrival process of the 
aircraft, that is very different from the Poisson process but has a distribution 
of the interarrival times very close to the exponential. The idea of the 
process, that will be described in details in the next section, is easy: we 
start with a deterministic schedule, organized in such a way that the aircraft 
should arrive in a deterministic and homogeneous way (say with a constant 
interarrival time similar to the time needed for a landing), and then we add 
an independent random variable to each of this scheduled arrivals. The 
resulting list of times of arrival is then mixed up by the random delay. The 
process obtained in this way has a long history iTJl. It is easy to study 
numerically but quite difficult to treat from a mathematical point of view, 
though significative progresses have been recently made [ ITirTol . 
The simulations we present in this paper show the following facts: 

1. The fit with the real data over Heathrow airport is really excellent, 
and it is incomparably better than the analogous calculation assuming 
Poissonian arrivals. This will be the subject of the discussion presented 
in Section [4] 

2. The distribution of the random delays added to each arrival time has a 
very small impact on the distribution of the queue. The only relevant 
parameter is the variance of the random delays. In order to have a 
reliable forecast of the traffic over a congested hub one can use the 
simplest distribution, e.g. the uniform one. This point will be discussed 
in details in Section [5] 

2 DESCRIPTION OF THE ARRIVAL PROCESS 

The Pre-Scheduled Random Arrivals (PSRA) process is defined as follows. 
Denoting with \ the expected interarrival time between two consecutive 
aircrafts, the actual arrival time of the z'-th aircraft (f; 6 R) is defined by 

U = \ + Si i e z (1) 
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where £,'s are i.i.d. continuous random variables with probability density 
fx (t) and variance a 2 . Without loss of generality we can assume E(£/) = 0, 
as E(£j) 7^ affects only the initial configuration of the system. 
When <t is large the process defined in Q weakly converges to the Poisson 
process, in particular it is possible to prove that its generating function 
tends pointwise to the generating function of the Poisson process [n]. This 
property also holds for a variant of the PSRA process that takes into account 
the possibility of flights' cancellation as in [4]. This variant is an independent 
thinning version of this process, i.e. a process in which each arrival has 
an independent probability 1 — 7 to be cancelled (and the complementary 
probability 7 to be a true arrival). In Air Traffic Management (ATM) contexts 
it is natural to couple a PSRA arrival process with a deterministic service 
process with expected service rate A. In this case the traffic intensity of the 
thinned process is clearly p = 7. 

Although both the PSRA process and its thinned version are very similar to 
the Poisson process for large a, they present a crucial difference with the 
latter: as soon as o~ remains finite, the PSRA process is negatively autocor- 
related. The covariance Cov(ni, n 2 ) between the number of arrivals at two 
consecutive time periods n\ and n 2/ where n\ is the number of arrivals in 
(t,t + T] and n 2 is the number of arrivals in (t + T,t + 2T], is given by 

Cov(m,n 2 ) =E(mn 2 )-E(n 1 )E(n 2 ) = -£pf%f + T)p5 <r) (f + T,f + 2T) 

i 

where p- (t\, t 2 ) is the probability that the f-th aircraft arrives in the interval 
[fi,f2]; for details, see Ril . A negative covariance means that n% and n 2 are 
inversely correlated, thus a congested time slot is likely to be followed or 
preceded by a slot with less-than-expected arrivals. Moreover, this proves 
that the hypothesis of independence for n\ and n 2 , numbers of arrivals in 
different time slots, is not correct, unless we are in the limit <r — >■ 00. 
Simulations show that if we neglect this correlation and we try to describe 
a queueing system with independent interarrival times, we obtain a gross 
overestimate for the average length of the queue. This error is particularly 
rough when the system is congested (traffic intensity p near to 1), see Figure^ 
below. 

3 REAL DATA FROM LONDON HEATHROW AIRPORT 

The London Heathrow airport is served by two parallel independent runways. 
The runways are used in a separate mode, meaning that one runway is 
used only for departures and the other is used only for arrivals. Aircraft 
occasionally are cleared to land on the departure runway to minimise delay 
subject to certain delay criteria. According to the ICAO Standard Arrival 
Chart, Heathrow has 28 STAR (standard arrival route) starting from 11 entry 
points; see Figure [1] 

In the intent of this analysis, actual data are used ranging from July 20 to 
July 30, 2010. This big database include 7,140 flights. A first analysis of these 
data shows that most of the flights (about 70%) enter from three entry points: 
LOGAN, ALESO and NUGRA and pass on STAR LAM 3 A, BIG 3B and 
BNN lB, respectively. Figure [2] shows a qualitative layout of the incoming air 
traffic in the aforesaid lapse of time. 

We limited ourselves to the three entry points mentioned above, and starting 
from this restricted database we observed that the traffic intensity of the 
system was not completely constant during the day but in the two slots from 
6:00 a.m. to 10:30 a.m. and from 4:00 p.m. to 8:00 p.m. For each flight landing 
in one of these two time intervals the entrance time in the system has been 
subtracted to the landing time. Let us call this difference approaching time. To 
obtain the amount of time each flight has been waiting in queue, we subtract 
from the approaching time of any aircraft entering from a given entry point 
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Figure v. A sketch of the 28 STAR of the London Heathrow Airport. 




Figure 2: Qualitative layout of the incoming air traffic over the London 
Heathrow Airport during the time lapse under examination. 
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the minimun approaching time from the same entry point. Note that the 
service time is nearly deterministic. This implies that the length of the queue 
that a given aircraft had to wait is proportional to the time spent in queue 
by that aircraft. Hence, the empirical distribution of the time spent in queue 
is equivalent to that of the queue lenght. Eventually, we have gathered the 
data from the three distinct queues and we have normalized the resulting 
histogram, which can be seen in Figure [3] All in all, the final number of 
flights considered was 4139. 

From the data we could also see that in the period above the average number 
of flights entering in the airport fly area per hour is 40, while the maximum 
number of flights landing on a single runway is 41 per hour. Therefore, 
in this analysis we will take the following measured value for the traffic 
intensity p, 

average number of arrivals per hour 40 „ „_ 

P = : i 7T T- i = TT = 0.976 

maximum number of landings per hour 41 

4 SIMULATIONS OF PSRA AND COMPARISON WITH HEATHROW DATA 

The data discussed in Section [3] may be used to get an idea of the high 
reliability of the PSRA model. In Figure [3^— [3ji the empirical distribution 
derived from actual data is compared with the empirical distribution obtained 
by simulation of models of interest. Subfigures a to d are displayed clockwise 
starting top left. 

Figure [3^ refers to the M/D/i case. Poissonian arrivals are de facto a 
standard assumption in many works and studies in the ATM field [H [5} 13 
[14] I17] but they give not a satisfactory prediction of the tail of the queue 
distribution whatsoever. Figure [^j? and j^Jf refer to the PSRA with a = ^ and 
17 ~ T' respectively- These values are reasonable guesses of the order of the 
actual standard deviation. We would need more data to validate this claim. 
Anyway, we clearly see that the PSRA model gives a very good accordance 
with empirical data, but to go one better and match what actually happens 
in real life operations we tried to introduce some randomness on the service 
time, modeled as a triangular random variable with mean i and mode 
The introduction of a such small source of randomness is completely 
legitimate by the need of the service providers to adjust the sequence of the 
arrivals on demand. The variant of PSRA obtained this way is shown in 
Figure ^p. We want to outline here that the simulations have been performed 
using a straightforward Python code. This is an important features of this 
model: it can be easily simulated. 

In Table [1] we show a measure of the accordance of the observed data with 
the simulated ones using Total Variation and Hellinger distance. Note that 
the distance in the case of memoryless arrivals is an order of magnitude 
larger than the distances obtained by PSRA arrivals. 



Model 




Total Variation distance 


Hellinger distance 


M/D/i 

Uniform Delays 
Uniform Delays 
Uniform Delays 


3 
T 
5 


0.41067 
0.07516 
0.03938 
0.05723 


0.43903 
0.08133 
0.04565 
0.05814 



Table 1: Total Variation and Hellinger distances between the empirical distri- 
bution of Heathrow data and the distributions considered in Figure [3] 



3 a = 20/a am j deterministic service time 

4 u — 30/a and deterministic service time 

5 u = 20/a and service times modeled as triangular random variable with mean 1/A and mode 

0.8/a 



5 



M/D/l vs Heatrow data 



Us20D vs Heatrow data 



LHR 

TZ7A M/D/l 



Line length 
Us30D vs Heatrow data 
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Line length 
Us20S vs Heatrow data 
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Figure 3: Fit of empirical distribution from Heathrow data with the empirical 
distribution from simulations of M/D/ 1 model and some variants of the 
PSRA model. Subfigures a to d are displayed clockwise starting top left. 



5 ROBUSTNESS OF THE RESULT 

Simulations were also used to study the robustness of the PSRA with respect 
to the choice of the probability law fg of the delays. We considered delays of 
different type, namely uniform, triangular, normal and exponential, with zero 
mean and fixed standard deviation a. 

Figure [4] suggests that the standard deviation of the delay is the actual 
parameter of the model and that the resulting queue distribution is insensitive 
of the nature of the delays. For symmetrical probability laws such as uniform, 
triangular and normal the histograms in Figure [4] present an astonishing 
resemblance and differ for only small fluctuations. 

Even more surprisingly, we obtain the same layout with a skew probability 
law, like the exponential distribution. Figure [4] clearly shows that the queue 
distribution obtained using exponentially delayed arrivals is qualitatively 
the same as the uniformly delayed case. This circumstance is of extreme 
interest for applications, for the exact solution of the model can be obtained 
for the case of exponential delays. In |io] a PSRA model with exponentially 
distributed £'s is considered and an iterative method to derive the explicit 
expression of the generating function is provided. In the same paper it 
can be seen via simulations of the model that the empirical and theoretical 
distributions completely agree. 

6 CONCLUSIONS 

In this paper, we present a model to describe the inbound air traffic over a 
congested hub. The model can be also used to study the possible effects of 
the application of future technologies to the Air Traffic Control liT2l . 
This work is a preliminary attempt to study this important problem, as 
such it should motivate further studies about the PSRA and its performance 
analysis. Hopefully, the features of this arrival process may give also an 
insight in the equally crucial problem of congestion reduction in complex 
systems. 
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Triangular Law 




Figure 4: Robustness of the PSRA model with respect to the choice of the 
probability law fr of the delays. The delays have zero mean and fixed 
standard deviation o~ = 23. Clockwise from top left histogram we have 
uniformly, triangularly, exponentially, and normally distributed £'s. 
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